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1 Query evaiuation teohniouesS for large databases 
Goetz Graefe 

June 1993 ACM Computing Surveys (CSUR), volume 25 issue 2 



7 8 9 10 nex 



Full text available: ^.pMSJ^a.MB). 



Additional Information: MLsJMiQn, si^^^Strgct, XBferenA 



Database management systems will continue to manage large data volumes. Thus, efficient algor 
will be required to provide acceptable performance. The advent of object-oriented and extensible 
modern data models exacerbate the problem: In order to manipulate large sets of complex object 
records, query-processi ... 

Keywords: complex query evaluation plans, dynamic query evaluation plans, extensible databas" 
operator model of parallelization, parallel algorithms, relational database systems, set-matching c 



Externa! memory algorithms and data structures: dealing with 




Jeffrey Scott Vitter 

June 2001 ACM Computing Surveys (CSUR), Volume 33 issue 2 

Full text available: ■^.pdilM^^^M.I^^^^ Additional Information: ful] cMioQ, si?sMot, Mereni 

Data sets in large applications are often too massive to fit completely inside the computers intern, 
between fast internal memory and slower external memory (such as disks) can be a major perfor 
in the design and analysis of external memory (or EM) algorithms and data structures, where the 
consider a varie ... 

Keywords: B-tree, I/O, batched, block, disk, dynamic, extendible hashing, external memory, hie 
memory, online, out-of-core, secondary storage, sorting 
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Christian Bohm, Stefan Berchtold, Daniel A. Keim 

September 2001 ACM Computing Surveys (CSUR), volume 33 issue 3 

Full text available: ■^.pdgl.MMSO. Additional Information: iM/^itiition, ?!bslracL rtDfi-ireiic 

During the last decade, multimedia databases have become increasingly important in many applic 
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biology. An important research issue in the field of multimedia databases is the content-based ret 
videos. However, in contrast to searching data in a relational database, a content-based retrieval 
the database system ... 

Keywords: Index structures, indexing high-dimensional data, multimedia databases, similarity s* 



* jnye|1ediiles,yersus.sign 

Justin Zobel, Alistair Moffat, Kotagiri Ramamohanarao 

December 1998 ACM Transactions on Database Systems (TODS), volume 23 issue a 

Full text available: '^i)c[iX24*L62.KS) Additional Information: fyjj.cRaii.m r§lMi?.CU 

Two well-known indexing methods are inverted files and signature files. We have undertaken a d€ 
text indexing, paying particular attention to query evaluation speed and space requirements. We 
experimentation and a refined approach to modeling of signature files, and demonstrate that inve 
inverted files be used to ev ... 

Keywords: indexing, inverted files, performance, signature files, text databases, text indexing 



^ Searching in metric spacesS 

Edgar Chavez, Gonzalo Navarro, Ricardo Baeza-Yates, Jose Luis Marroquin 
September 2001 ACM Computing Surveys (CSUR), volume 33 issue 3 

Full text available: *^.pdX(GJ.6,G4 KSi Additional Information: Ml citation, abstract, referenc 

The problem of searching the elements of a set that are close to a given query element under son 
branches of computer science, from pattern recognition to textual and multimedia information ret 
similarity criterion defines a metric space, instead of the more restricted case of a vector space. ^ 
cases without cros ... 

Keywords: Curse of dimensionality, nearest neighbors, similarity searching, vector spaces 

^ Compuling..cujjjcuj.a,20^^^ 

September 2001 Journal on Educational Resources in Computing (JERIC) 

Full text available: "^pdfl61M3.i<B)-.i^.-^ Additional Information: M cjtetion, referen^^?. J^iD^s, jnd 

^ Pgn. compiiting;..aiech 
Andre Meyer 

July 1995 ACM SIGCHI Bulletin, Volume 27 issue 3 

Full text available: '^ pdf(5.14 MBi Additional Infornnation: U;W cit<3tiori . ab^iiract, ch:=f;^^$:, 

This work gives an overview of a new technology that is attracting growing interest in public as W' 
other technologies is in the use of a pen or pencil as the primary means of interaction between a 
interface metaphor. From this follows a set of consequences that will be analyzed and put into cor 
with a short historic ... 

^ Bitmapjndex.design.an^ 

Chee-Yong Chan, Yannis E. loannidis 

June 1998 ACM SIGMOD Record , Proceedings of the 1998 ACM SIGMOD international 

Full text available: "^.Rdt^vlvllMB). Additional Information: .tM.CJtation, si?str3Qt referent 

Bitmap indexing has been touted as a promising approach for processing complex adhoc queries i 
systems. Nevertheless, only few possible bitmap schemes have been proposed in the past and ve 
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In this paper, we present a general framework to study the design space of bitmap indexes for se 
characteristics that the various al ... 

9 On randomization in sequi3ntiai and distributed algorithnis 
Rajiv Gupta, Scott A. Smolka, Shaji Bhaskar 

March 1994 ACS^ Computing Surveys (CSOIR), volume 26 issue i 

Full text available: "^.p.^jliBJllMB.}, Additional Information: .M.dMion, .^i?stract, mferen^ 

Probabilistic, or randomized, algorithms are fast becoming as commonplace as conventional deter 
have been widely used in the design of randomized algorithms. These techniques are illustrated u 
distributed— that span a wide range of applications, including :prlmality testing (a classical proble 

keywords: Byzantine agreement, CSP, analysis of algorithms, computational complexity, dining 
isomorphism, hashing, interactive probabilistic proof systems, leader election, message routing, n 
probabilistic techniques, randomized or probabilistic algorithms, randomized quicksort, sequential 



integrated„docyment„cach 
Achlm Kraiss, Gerhard Weikum 

August 1998 The VLDB Journal — The International Journal on Very Large Data Bases, ^ 

Full text available: pdf(6{}3.01 KB) Additional Information: I-;!! oitation . ab^ilnKii. cW}r\Qi\ . 

Large multimedia document archives may hold a major fraction of their data in tertiary storage lit 
approach to the vertical data migration between the tertiary, secondary, and primary storage in tl 
latency of the tertiary storage, with the replacement policy of the document caches at the second 
interaction of these policies with ... 

GCeywords: Caching, Markov chains. Performance, Prefetching, Scheduling, Stochastic modeling. 



Hanan Samet 

June 1984 ACM Computing Surveys (CSOB^), volume 16 issue 2 

Full text available: 'g|.pc{iitl87.M3) Additional Infornnatlon: &:!!.cliatjon, ?;$;fei:ence|?. d\\nm, MmJMsm 



''^ E)ata ckistenng;..a.re^^ 

A. K. Jain, M. N. Murty, P. J. Flynn 

September 1999 ACM Computing Surveys (CSOR), Volume 3i issue 3 

Full text available: ■^^dft6M-2:lKB) Additional Information: tujh:jtaiion, ttetrsct, r6M§.a> 

Clustering is the unsupervised classification of patterns (observations, data items, or feature vect 
addressed in many contexts and by researchers in many disciplines; this reflects its broad appeal 
However, clustering is a difficult problem combinatorially, and differences In assumptions and con 
generic co ... 

CCeywords: cluster analysis, clustering applications, exploratory data analysis, incremental cluste 



''^ EQsolving.confyctsjn 
R. H. katz', E. Wong 

March 1983 ACM Transactions on Database Systems (TODS), volume 8 issue i 

Full text available: 'gl pdiTI.68 MB) Additional Information: foil cihation . ;?bstr:;i^:t, reiferen- 

We present a conceptual framework in which a database's intra- and interrecord set access requir 
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characteristics ("evaluated," "indexed," "clustered," "well-placed") to logical access paths. We der 
structure that most closely provides the desired access characteristics. We use explicit replication 

Keywords: access path selection, functional data model, storage structure choice 



De§.[gning..and mlnin 

Alexander S. Szalay, Peter Z. Kunszt, Ani Thakar, Jim Gray, Don Slutz, Robert J. Brunner 

IVlay 2000 ACM SIGMOD Record , Proceedings of the 2000 ACM SIGMOD international 

Full text available: ' g[t)cijM29 09 KB) Additional Information: full cllation , ;?t;strgi^:t, reii^reni 

The next-generation astronomy digital archives will cover most of the sky at fine resolution in ma 
infrared. The archives will be stored at diverse geographical locations. One of the first of these pn 
wavelength catalog over 10,000 square degrees of the sky (see http;//www. sdss.org/). The 200 r 
numerical attribut ... 

Keywords: Internet, archive, astronomy, data analysis, data mining, database, scalable 



PMQrmance.issyes:^ 

Michael Hammer, Bahram Niamir 

May 1979 Proceedings of the 1979 ACM SIGMOD international conference on Manage 

Full text available: ■g.adS122 MB^ Additional Information: Ml.cltalion, abstract, refereo< 

One technique that is sometimes employed to enhance the performance of a database managemc 
of dividing the attributes of a file into separately stored subfiles. By storing together those attribu 
by separating those that are not, attribute partitioning can reduce the number of pages that are t 
processing of a tran ... 

EjasldeieMQn.of.com 

Thomas Kunz, Michiel F. H, Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies or 

Full text available: "glpdlMJlMB) Additional Information: fuicltayon, 3^i?tFc;ct, referenc 

Understanding distributed applications Is a tedious and difficult task. Visualizations based on proo 
understanding of the execution of the application. The visualization tool we use is Poet, an event i 
diagrams are often very complex and do not provide the user with the desired overview of the ap 
occurrences of non-trivial commun ... 

ExMrna!..memory.,a^^ 
Jeffrey Scott Vitter 

May 1998 Proceedings of the seventeenth ACM SIGACT-SIGi^OD-SIGART symposium or 

Full text available: " Pl pcjfi;i.6B MB) Additional Information: UiW citation , referenc^i^ . o;tinq?>, mkix ten- 



Extensible Jle.s/MemiEU:B);^ 

John F. Karpovich, Andrew S. Grimshaw, James C. French 

October 1994 ACM SIGPLAN Notices , Proceedings of the ninth annual conference on Ob: 

applications, volume 29 Issue 10 
Full text available: •gjpdfil.M.fvl.B) Additional Information: fun citation, abstract, referea< 

Scientific applications often manipulate very large sets of persistent data. Over the past decade, c 
been outpaced by advances in the performance of the rest of the computer system. As a result, n 
run-times are dominated by the time spent performing I/O operations. Consequently, the perforn- 
performance in these applicatio ... 
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19 Semistructured Data: Structural proximity searching for large coilectjons of senii-structured c 
Michael Barg, Raymond K. Wong 

October 2001 Proceedings of the tenth international conference on Information and kno 

Full text available: 'Qpdf(lJ2.jyiB} Additional Information: fvlHcitatjon, aostrsct, rejereni 

The richness of the XML data format allows data to be structured in a way which precisely capture 
the data, however, which forms the basis of all XML query languages. Without at least some notic 
This problem is compounded when one considers that heterogeneous data adhering to different s< 
paper proposes a soluti ... 

2° Multidimensionai.ac 

Volker Gaede, Oliver Gunther 

June 1998 ACM Computing Surveys (CSUR), volume so issue 2 

Full text available: 'g.pdf(.L05 MB). Additional Information: .yj..citytion, ?Lbstract, .[:efere!}< 

Search operations in databases require special support at the physical level. This is true for conve 
search operations include the point query (find all objects that contain a given search point) and t 
region). More than ten years of spatial database research have resulted in a great variety of multi 

Keywords: data structures, multidimensional access methods 
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